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1 Gene 1 £ 
1 sycobol 1 


Ho 1 


Name 1 


Seq3' 


Seq5» 


Ref 




19 1 


(autotaxin) | 
v-rel avian reticuloendotnexiosas i ^ 
viral oncogene homolog a (nuclear 1 • 
factor of kappa light polypeptide j 
gene enhancer in b-cells 3 (p65) ) j 


3EQ ID 
No: 42 


i 




SBQ IdI 
No:43 1 


[ 

1 XTK 1 


on 1 
i 


il2- inducible t-cell kinase 




SEQ ID 
JMO: 44 


SBQ ID 1 
No: 45 1 


1 1 


TJl 1 

^ Jit 1 


tenascin xb 1 




SEQ ID 

LIO : 40 


S^ IDI 
No : 47 1 


1 CSirJL 1 


1 


colony stimulating factor 1 I 
(macrophage) 1 


SEQ ID 
No: 48 


SBQ ID 

• AQ 


SEQ ID 
No : 5 0 1 




Q"^ 1 
1 


villin 2 (ezrin) 1 


SEQ ID 
No: 51 


SBQ ID 

Tin • 


SBQ ID 
No : 53 1 


APC 1 


24 


adenomatosis polyposis coli j 


SBQ ID 
NO:54 


SEQ ID 
No: 55 


SEQ ID 
No: 56 1 


1 MUCl 


25 1 


mucin 1, transmembrane 1 




SEQ ID 

No:57 


SEQ ID 1 

No: 58 1 


IGF2 


26 


insulin- like growth factor 2 
(somatomedin a) 


SBQ ID 
No: 59 


SEQ ID 
No: 60 


SEQ ID 
No:61 j 


1 BMRl 


27 


egf-like module containing, raucm- 
like, hormone receptor-like 
sequence 1 


SEQ ID 
No: 62 


SEQ ID 
No: 63 


SEQ IdI 

No: 64 


|kXAA042 
1 ^ 


on 


kiaa0427 gene product 


1 SEQ ID 
1 No : 65 


SEQ ID 
No : 66 


SBQ ID 
No : 67 t 


1 ow 

1 Sjljx. 


1 OQ 

1 ^27 


spleen tyrosine kinase 


1 SEQ ID 
1 No: 68 


SBQ ID 
No : 69 


SBQ ID 
No : 7 0 1 


1 IliVR 


1 n 
1 


1 interleukin 7 receptor 




SEQ ID 
No: 71 


SBQ ID 1 
"Mr» • T> 1 

S*\J • / ^ 1 


1 urvr* 
1 nx^ 


1 "^l 

1 J X 


1 v-rayc avian myelocytoroatosis viral 
1 oncogene homolog 


1 SEQ ID 
1 No:73 


SEQ ID 

Jwo : /4 


SEQ ID 
No; 75 


1 IsnliU 


1 

1 J A 


1 gat a -binding protein 3 


1 SEQ ID 
1 No: 76 


SBQ ID 

Not i 1 


SEQ ID 
No : 78 


1 uxua / 


1 33 


1 growth factor receptor-bound 
1 protein 7 


SBQ ID 
1 No: 79 


SEQ ID 
MO : ou 


SEQ ID 
No: 81 


1 TO'O^m 


1 34 


itopoisomerase (dna) ii beta (IBOkd) 


1 


SBQ ID 

£iO • 


SEQ ID 
No : 83 


1 CASP4 


1 


1 caspase 4/ apopi-os*s— .texeiuevi 
1 cysteine protease 


SEQ IT 
1 No: 84 


\ 


SEQ ID 
No: 85 


1 TIMP2 


1 


1 tissue inhibitor of 
j metalloproteinase 2 




SBQ IC 
No:86 


SEQ ID 
NO:87 


j DOT 


1 


1 d-dopachrome tautomerase 


SEQ II 
1 NO:88 


) SBQ ir 
No: 89 


) SBQ ID 
No: 90 


1 PRIi 


1 


1 prolactin 


1 SEQ n 


} SBQ II 


) SBQ ID 



WO02/464<S7 



7 



PCTABOl/Omi 



The invention cancems also a method for detecting 
differentially expressed polynucleotide sequences which are 
correlated with a cancer, said method conqprising: 

a) obtaining a polynucleotide sample from a patient; and 

b) reacting the saim>le polynucleotide obtained in step 
(a) with a probe immobilized on a solid support wherein said 
probe coT.?»rises any of the polynucleotide sequences of the 
libraries previously described or an expression product 
encoded by any of the polynucleotide sequences of Baid 

libraries and 

c) detecting the reaction product of step (b) . 

The invention relates also to a such method for 
detecting differentially expressed polynucleotide sequences 
of the invention wherein the amount of reaction product of 
step (c) is compared to a control sample. 

Preferably the polynucleotide satnple isolated for, the 

sample is RHA or mRNA. 

Preferably the polynucleotide sample is cDNA obtained by 

reverse transcription of the mRNA. 

in a prefered embodiment the method for detecting 
differentially expressed polynucleotide sequences, the step 
(b) comprises a hybridization of the sample RNA with the 
labeled probe. 

The method for detecting differentially expressed 
polynucleotide sequences is used for detecting, diagnosing, 
staging, monitoring, prognosticating, preventing or treating 
conditions associated with cancer, and namelly breast cancer. 

The method for detecting differentially expressed 
polynucleotide sequences is particular useful wherein the 
product encoded by any of the polynucleotide sequences or 



wo 02/46467 



19 



PCT/IBOl/02811 



Preferably, the polynucleotide array useful to 
classify good and poor prognosis primary breast tumors bears 
any coinbination of at least one polynucleotide sequence 
selected among those included in each one of predefined 
polynucleotide sequences sets defined in table 9A together 
with at least one polynucleotide sequence selected among 
those included in each one of predefined polynucleotide 
sequences sets defined in t€d3le 9B. 

The present invention concerns also a method for 
detecting differentially expressed polynucleotide sequences 
that are correlated with a cancer, said method comprising: 

a) obtaining a polynucleotide san5>le from a 
patient; and 

b) reacting the aample polynucleotide obtained in 
step (a) with a probe immobilized on a solid support wherein 
said probe conprises any of the polynucleotide sequences of 
the libraries previously defined or an expression product 
encoded by any of the polynucleotide sequences of the 
libraries previously defined 

c) detecting the reaction product of step (b) . 

Preferably, the polynucleotide sample obtained at 
step (a) is labeled before its reaction at step (b> with the 
probe immobilized on a solid support. 

The label of the polynucleotide sample is 
selected from the group consisting of radioactive, 
colorimetric, enzymatic, molecular amplification, 

bioluminescent or fluorescent. 
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In a particular embodiment the reaction product 
of step (c ) is quantified by further comparison of said 
reaction product to a control sanq;>le. 

in a first embodiment, the polynucleotide aaxaple 
isolated from the patient and obtained at step (a) is either 
KMA or inRMA. 

in another embodiment the polynucleotide saitple 
isolated from the patient is cDNA is obtained by reverse 
trsmscription of the aiRNA. 

Preferably the reaction step (b) of the method 
for detecting differentially expressed polynucleotide 
sequences comprises a hybridization of the sample RNA issued 
from patient with the probe. 

Preferably the sample RNA is labeled before 
hybridization with the probe and the label is selected from 
the group consisting of radioactive, colorimetric, enzymatic, 
molecular an^lif ication, bioluminescent or fluorescent. 

This method for detecting differentially 
expressed polynucleotide sequences is particularly useful for 
detecting, diagnosing, staging, monitoring, prognosticating, 
preventing or treating conditions associated with cancer, and 
particularly breast cancer. 

The method for detecting differentially expressed 
polynucleotide sequences is also particularly useful when the 
product encoded by any of the polynucleotide sequences or 
subsequences set is involved in a receptor -ligand reaction on 
which detection is based. 

The present invention is also related with a 
method for screening an anti-tumor agent cotrprising the 
method the above -depicted method for detecting differentially 
expressed polynucleotide sequences wherein the san?)le has 
been treated with the anti-tumor agent to be screened. 
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In a particular embodiment the method for 
screening an anti -tumor agent conqprises detecting 
polynucleotide sequences reacting with at least one library 
of polynucleotides or polynucleotide sequences set as 
previously defined or of products encoded by said library in 
a sample obtained from a patient. 

The invention is illustrated by examples detailed 

below related to particular experimental results obtained 

with selected libraries of polypeptides useful to identify 
and distinguish tumor samples from normal ones. 

Tiimor aamplefl and RN lv axtraction 

To avoid any bias of selection as to the type and 
size of the tumors, the RNAs to be tested were prepared from 
unselected samples. Samples of primary invasive breast 
carcinomas were collected from 34 patients undergoing surgery 
at the institute Paoli-Calmette. After surgical resection, 
the tumors were macrodissected: a section was taken for the 
pathologist's diagnosis and an adjacent piece was quickly 
frozen in liquid nitrogen for molecular analyses. The median 
age of patients at the time of diagnosis was 55 years (range 
39, 83) and most of them were post-menopausal - Tumors were 
classified according to the WHO histological typing of breast 
tumors in I 29 ductal carcinomas, 2 lobular carcinomas, 1 
mixed ductal and lobular carcinoma, and 2 medullar 
carcinomas. They had various sizes, inferior or equal to 20 
ram (n = 13), between 20 and 50 mm (n = 18) or superior to SO 
tnm (n = 3), axillary' s lymph node status (negative: 19 
tumors, positive: 15 tumors), SBR grading (I: 3 tumors, II: 
20 tumors. III: 10 tumors, not evaluable: 1 tumor), and 
estrogen receptor status (ER) evaluated by 
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itmirunohistoclxemical assay (23 BR-positive, 11 ER-negative) . 
BR positivity cutoff value was 10%. Adjuvant treatment with 
radiotherapy and when necessary multi-agent anthracyclin- 
based chemotherapy (n = 16) was given to patients according 

to local practice - 

Total RNA was extracted from tumor samples by 
standard methods (43) . Total RNA from normal breast tissue 
was obtained from Clontech (Palo Alto. CA) : RNA was isolated 
from 8 tissue specimens from Caucasian females, age range 23 
_ 47. RNA integrity was controlled by denaturing 

formaldehyde agarose gel electrophoresis and Northern blots 
using a 28S-specific oligonucleotide- 

cDNA arrays pre paration 

Gene expression was analyzed by hybridization of 
arrays with radioactive probes. The arrays contained PGR 
products of 5 control clones, and 180 IMAGB human cDNA clones 
selected with practical criteria (3' sequence of mRNA, same 
cloning vector, host bacteria and insert size) . This 
represented 176 genes (4 genes were represented by 2 
different clones) : 121 with proven or putative implication 
in cancer and 55 in^jlicated in immune reactions (the list is 
available on the web site: http:/tagc.univ- 
mrs.fr/pub/cancer/). Their identity was verified by 5' tag- 
sequencing of plasmid DNA and comparison with sequences in 
the EST (dbEST) and nucleotide (GenBank) databases at the 
NCBI, Identity was confirmed for all but 14 clones without 
significant gene similarity, which were referenced by their 
GenBank accession number. The control clones were: 

Arabidopsis thaliana cytochrome c554 gene (used for 
hybridization signal normalization). 3 poly (A) sequences of 
different sizes and the vector pT7T3D (negative controls) - 
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PCR ainplification, ptirification and roibotical 
spotting of PCR products onto Hybond-N+ membranes (Amersham) 
were done according to described protocols (4). All PGR 
products were spotted in duplicate. For normalization 
purpose, the c554 gene was spotted 96-fold scattered over the 
whole membrane. 

npua. array hybrid izations 

Hybridizations were done successively with a 
vector oligonucleotide (to precisely determine the amount of 
target DNA accessible to hybridization in each spot), then 
after stripping of vector probe, with complex probes made 
from the RNAs (4). Each complex probe was h^ridized to a 
distinct filter. Probes were prepared from total RNA with an 
excess of oligo(dT25) to saturate the poly (A) tails of the 
messengers, and to insure that the reverse transcribed 
product did not contain long polY(T) sequences. A precise 
amount of c554 itiRNA was added to the total RNA before 
labeling to allow normalization of the data. 

Five ng of total RNA (-lOOng of mRNA) from tissue 
samples were used for each labeling. Probe preparation and 
hybridization of the membranes were done according to known 
procedures (http : /tagc . univ-mrs . f r/pub/Cancer/ ) . 

Hybridization was done in excess of target (-15 
ng of DNA in each spot) and binding of cDNAs to the targets 
was linear and proportional to the quantity of cDNA in the 
probe . 

Detection and quantific ation of cD^^ — ^ray 

h ybridization signals 

Quantitative data were obtained using an imaging 
plate device. Hybridization signal detection with a FUJI HAS 
1500 machine and quantification with the HDG Analyzer 
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software (Genomic Solutions, Ann Arbor, MI) were done as 
previously described (http:/tagc.univ-mr8.fr/pub/Cancer/) . 
Quantification was done by integrating all spot pixel 
intensities and substracting a spot background value 
determined in the neighboring area. Spots were located with a 
LaPlacian transformation. Spot background level was the 
median intensity of all the pixels present in a small window 
centered on the spot and which were not part of any spot 
(44). Quantified data were normalized in three steps and 
expressed as absolute gene expression levels (i.e. in 
percentage of abundance of individual mRNA with respect to 
tHRNA within the sauqple) , as described (4) . 

Array r ^"*-" aT^alysls 

Before analysis of the results, the 
reproducibility of the experiments was verified by cott^aring 
duplicate spots, or one hybridization with the same probe on 
two independent arrays, or two independent hybridizations 
with probes prepared from the same RNA. In every case, the 
results showed good reproducibility with respective 
correlation coefficients of 0.95, 0.98 and 0.98 (data not 
shown) . Moreover, genes represented by two different clones 
on the array, such as CDK4 or ETV5, displayed similar 
expression profiles for the two clones in all samples. This 
reproducibility was sufficient enough to consider a 2 -fold 
escpression difference as significantly differential. 

For graphical representation, data were displayed 
as absolute expression levels (Pig- 2a). For better 
visualization of clustering, results were log- transformed and 
displayed as relative values median-centered in each row and 
in each column (Pig. 2b). Hierarchical clustering was 
applied to the tissue samples and the genes using the Cluster 
program developed by Eisen (45) (average linkage clustering 
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using Pearson correlation as similarity metric) . Results in 
Figs. 2 and 3 were displayed with the TreeView program (45) . 

Subsequent analysis was done using Excel software 
(Microsoft) and statistical analyses with the SPSS software. 
Metastasis-free survival and overall survival were measured 
from diagnosis until the first metastatic relapse or death 
respectively. They were estimated with the Kaplan-Meier 
method and compared between groups with the Log-Rank test. 
Correlations of gene pairs based on expression profiles were 
measured with the correlation coefficient r. The search for 
genes with expression levels correlated with tumor parameters 
was done in several successive steps. 

First, genes were detected by comparing their 
median expression level in the two subgroups of tumors 
discordant according to the parameter of interest. The median 
values rather than the mean values were used because of the 
high variability of the expression levels for many genes, 
resulting in a standard deviation of expression level similar 
or superior to the mean value and making comparisons with 
means inpossible. Second, these detected genes were 
inspected visually on graphics, and finally, an appropriate 
statistical analysis was applied to those that were 
convincing to validate the correlation. Comparison of GATA3 
expression between ER-positive tumors and ER-negative tumors 
was validated using a Mann-Witney test. Correlation 
coefficients were used to compare the gene expression levels 
to the number of axillary nodes involved. 

Northern blo «- aTialyaia 

Seventy-nine breast tumors, including 22 of the 
34 tested on the arrays, were analyzed for GATA3 expression 
by Northern blot hybridization. RNA extraction from tumor 
san5)les and Northern blots were done as previously described 
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CLAIMS 



1. A polynucleotide library useful in the 
molecular characterization of a carcinoma, said library 
comprising a pool of polynucleotide sequences or subsequences 
thereof wherein said sequences or subsequences are either 
underexpressed or overpressed in tumor cells, further wherein 
said sequences or subsequences correspond substantially to 
any of the polynucleotide sequences set forth in any of SEQ 
ID Nos: 1 - 468 or the complement thereof. 

2. A polynucleotide library according to Claim 1 
wherein said polynucleotide sequences or subsequences thereof 
of said pool correspond to any combination of at least one 
polynucleotide selected among those included in anyone of the 
following predefined sets : 

SET 1: (SEQ ID No:l; SEQ ID No:2); SET 2: (SEQ ID 
No:3; SEQ ID No:4) f SET 3: (SBQ ID No:5; SEQ ID No:6); SET 4: (SEQ 
ID No:7;SEQ ID No:8); SET 5: (SEQ ID No:9; SEQ ID No:10); SET 6: 
(SEQ ID No:ll: SEQ ID No:12); SET 7: (SEQ ID No:13; SEQ ID 
No:14;SEQ ID No:15) SET 8: (SEQ ID No:16) ; SET 9: (SBQ ID No:17; 
SBQ ID No:18; SBQ ID Ko:19); SET 10: (SBQ ID No:20,- SBQ ID No:21) ; 
SET lit (SEQ ID No:22; SBQ ID llo:23j SBQ ID MO:24) ; SET 12: (SBQ 
ID No:25; SBQ ID No:26) ; SET 13: (SBQ ID NO:27; SBQ ID No:28; SEQ 
ID No:29); SET 14: (SBQ ID No:30; SBQ ID Nos 31) ; SET 15: (SBQ ID 
No:32; SBQ ID No:33; SBQ ID No:34)) ; SET 16 : (SEQ ID NO:35) ; 
SET 17 : (SBQ ID No:36; SEQ ID No:37; SBQ ID No:3a) ; SET 18 : 
(SEQ ID No:39; SBQ ID NO:40; SEQ ID NO:41) ; SET 19 : (SEQ ID 
NO:42; SEQ ID No:43) ; SET 20 : (SBQ ID No:44; SBQ ID No:45) ; SET 
21 : (SEQ ID 110:46; SBQ ID No:47) ; SET 22 : (SBQ ID No:4a; SEQ ID 
Hoi49; SEQ ID No:50) ; SET 23 : (SEQ ID NOtSlf SBQ ID NO:52; SBQ 
ID No:53) ; SET 24: (SBQ ID No:S4; SBQ ID NoiSS; SBQ ID No:56) ; 
SET 25: (SEQ ID No:57; SEQ ID No:58) ; SET 26: (SBQ ID No:59; SBQ 
ID No:60; SEQ ID Not61) j SET 27: (SBQ ID No:62; SBQ ID No:63; SEQ 
ID No:64) ; SET 28: (SBQ ID No:65; SBQ ID No:66; SBQ ID NO:67) ; 
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SET 29: (SEQ ID No:68; SBQ ID No:69; SEQ ID No:70) ; SET 30: (SEQ 
ID NO:71; SEQ ID No:72) ; SET 31 : (SEQ ID NO:73; SBQ ID No:74, 
SEQ ID No:75) ; SET 32 : (SBQ ID lto:76; SEQ ID No:77; SBQ ID 
Ho:78) ,- SET 33 : (SBQ ID No:79; SEQ ID No:80; SBQ ID No:81) ; SET 
34: (SBQ ID NO:82; SEQ ID NO:83) ; SET 35: (SBQ ID No:84; SBQ ID 
No: 85) ; SET 36: (SBQ ID No: 86; SBQ ID No: 87) ; SET 37: (SBQ ID 
No:88; SEQ ID No:89; SEQ ID KO:90) ; SET 38: (SEQ ID No:91; SEQ ID 
No:92; SEQ ID NO:93) ; SET 39: (SEQ ID NO:94; SBQ ID NO:95,- SBQ ID 
No:96) f SET 40: (SEQ ID No:97; SBQ ID No:98; SEQ ID No:99) ; SET 
41: (SEQ ID No:100f SEQ ID No:101; SEQ ID NO:78) ; SET 42: (SBQ ID 
No: 102; SBQ ID No: 103) i. SET 43: (SBQ ID No: 104, SBQ ID No: 105) ; 
SET 44: (SEQ ID No: 106; SEQ ID No: 107; SEQ ID No: 108) ; SET 45: 
(SBQ ID NO:109; SEQ ID No:110> ; SET 46: (SEQ ID No:lll; SEQ ID 
No: 112; SEQ ID No: 113) ; SET 47: (SBQ ID No: 114) ; SET 48. (SEQ ID 
No:115; SEQ ID No:116; SEQ ID No:117) ; SET 49: (SEQ ID No:118; 
^BQ ID No: 119) ; SET 50: (SEQ ID No: 120; SBQ ID No: 121) ; SET 51: 
(SBQ ID No:122; SEQ ID No:78) ; SET 52: (SEQ ID No:123; SBQ ID 
No:124; SBQ ID NO:125) ; SET 53: (SBQ ID No:126, SBQ ID No:127; 
SEQ ID No:128) ; SET 54: (SBQ ID NO:129; SEQ ID Mo:130) ; SET 55: 
(SBQ ID NO:131; SEQ ID NO:132) ; SET 56: (SEQ ID No:133; SBQ ID 
No: 134) ; SET 57: (SBQ ID NO: 135; SEQ ID NO: 136; SEQ ID No: 137) ; 
SET 58: (SBQ ID No:138; SBQ ID NO:139; SEQ ID No:140) ; SET 59: 
(SEQ ID No: 141; SBQ ID No: 142; SBQ ID No: 143) ; SET 60: (SEQ ID 
No:144; SEQ ID No:145; SBQ ID No:146) ; SET 61: (SBQ ID No:147; 
SBQ ID No: 148; SBQ ID No: 149) ; SET 62: (SEQ ID No: 150; SEQ ID 
No:151; SBQ ID No:152) ; SET 63: (SEQ ID NO:153; SBQ ID NO:154; 
SEQ ID No: 155) ; SET 64: (SEQ ID No: 156; SBQ ID No: 157; SBQ ID 
No: 158) ; SET 65: (SEQ ID No: 159; SBQ ID No: 160; SBQ ID No: 161) ; 
SET 66: (SBQ ID No:162; SEQ ID No:163> ; SET 67: (SBQ ID No:164; 
SEQ ID No:165) ; SBT 68: (SEQ ID No:166; SEQ ID No:167; SBQ ID 
No: 152) ; SBT 69: (SBQ ID No: 168; SBQ ID No: 169; SEQ ID No: 170) ; 
SBT 70: (SBQ ID NO:171; SBQ ID No:172) ; SBT 71: (SEQ ID No:173; 
SBQ ID No: 174; SBQ ID NO:175) ; SET 72: (SEQ ID No: 176; SEQ ID 
No: 177) ; SET 73: (SEQ ID No: 178; SBQ ID No: 179) ; SET 74: (SEQ ID 
No: 180; SEQ ID No: 181; SBQ ID No : 182) ; SET 75: (SBQ ID No : 183; 
SBQ ID No: 184) ; SET 76: (SBQ ID No: 185) ; SBT 77: (SEQ ID No: 186) 
, SBT 78: (SEQ ID No: 187; SEQ ID No: 188) ; SET 79: (SEQ ID No: 189; 
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SEQ ID No:190; SBQ ID NO:191) ; SET 80: (SBQ ID No:192; SEQ ID 
No:193) ; SET 81: (SBQ ID No:194; SBQ ID No:195) ; SET 82: (SEQ ID 
Ifo.-196; SBQ ID KO:197, SEQ ID No:198) ; SET 83: (SEQ ID No:199; 
SBQ ID NO:200) ; SET 84: (SEQ ID NO:201; SBQ ID No:202; SBQ ID 
No:203) ; SET 85: (SEQ ID No:204; SEQ ID No:205) ; SKT 86: (SEQ ID 
No:206; SBQ ID Ko:207) , SET 87: (SEQ ID NO:208; SEQ ID NO:209) , 
SET 88: (SEQ ID No:210; SEQ ID No:211) ; SET 89: (SBQ ID Nos212, 
SEQ ID Mo:213) ; SET 90: (SEQ ID No:2X4; SBQ ID No:215> ; SET 91: 
(SBQ ID NO:216; SBQ ID No:217) ; SET 92: (SBQ ID No:218; SBQ ID 
No:219; SBQ ID No:220) ; SET 93: (SBQ ID No:221; SBQ ID NO:222) ; 
SET 94: (SEQ ID No:223; SBQ ID No:224; SEQ ID No:22S) ; SET 95: 
(SEQ ID No:226, SBQ ID No:227) , SET 96: (SEQ ID No:228; SEQ ID 
No:229) ; SET 97: (SEQ ID No:230; SEQ ID No:231; SEQ ID No:232) ; 
SET 98: (SBQ ID No:233? SEQ ID No:234) SET 99: (SEQ ID No:235; 
SEQ ID No:236; SEQ ID No:237) ; SET 100: (SEQ ID No:238, SBQ ID 
no: 239) ; SET 101: (SEQ ID No: 240; SEQ ID No: 241) ; SET 102: (SEQ 
ID no: 242; SEQ ID No: 243; SEQ ID Mo: 244) ; SET 103: (SBQ ID 
No:245, SEQ ID No:246; SBQ ID No:247) , SET 104: (SBQ ID No:248, 
SBQ ID No:249) ? SET 105: (SEQ ID No:250; SBQ ID No:251; SBQ ID 
NO:252) ; SET 106: (SBQ ID Mo:253; SEQ ID No:254) ; SET 107: (SEQ 
ID No:255; SEQ ID No:256) ; SET 108: (SBQ ID No:257; SEQ ID 
No:258) ; SET 109: (SEQ ID No:259; SEQ ID Nc3':260; SBQ ID No:261) ; 
SET 110: (SEQ ID No:262; SEQ ID No:200) ; SET 111: (SBQ ID No:263; 
SEQ ID No:264) ; SET 112: (SEQ ID No:265; SEQ ID No:266) ; SET 
113: (SEQ ID No:267; SEQ ID No:268) ; SET 114: (SBQ ID No:269; SEQ 
ID No:270) ; SET 115: (SEQ ID No:271; SEQ ID No:272) , SET 116: 
(SEQ ID No:273; SBQ ID No:274) ; SET 117: (SBQ ID No:275; SEQ ID 
No:276) ; SET 118: (SEQ ID No:277; SBQ ID NO:278) , SET 119: (SEQ 
ID No: 279; SEQ ID No: 280; SEQ ID No: 281) ; SET 120: (SEQ ID 
NO:282; SEQ ID No:283; SBQ ID No:276) ; SET 121: (SEQ ID No:284; 
SEQ ID NO:285) ; SET 122: (SBQ ID No:286; SEQ ID No:287; SEQ ID 
No:288) ; SET 123: (SEQ ID No:289; SEQ ID No: 290) ; SET 124: (SBQ 
ID No: 291; SBQ ID No: 292) ; SET 125: (SEQ ID No: 293; SEQ ID 
No:294; SEQ ID No:295) ; SET 126: (SBQ ID No:296; SBQ ID NO:297) ; 
SET 127: (SEQ ID No:298; SEQ ID No:299; SBQ ID No:300) ; SET 128: 
(SEQ ID No:301; SEQ ID No:302f SEQ ID NO:288) ; SET 129: (SBQ ID 
No:303, SBQ ID No:304) ; SET 130: (SEQ ID No:305; SEQ ID No:306; 
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SEQ ID No:307) , SOT 131: (SBQ ID No:308; SEQ ID No:309, SEQ ID 
No-310> ; SET 132: (SBQ ID No:311; SEQ ID No:312; SEQ ID No:313) ; 
SET 133: (SEQ ID No:314; SEQ ID No:315; SEQ ID No:316) ; SET 134: 
(SEQ ID No:317; SBQ ID NO:318) ; SET 135: (SEQ ID No:319; SBQ ID 
Ko:320; SBQ ID No:321) , SET 136: (SEQ ID No:322; SEQ ID No:323) ; 
SET 137: (SBQ ID No:324; SEQ ID No:325) ; SET 138: (SEQ ID No:326; 
SEQ ID No:327; SEQ ID No:328) ; SET 139: (SEQ ID No:329; SEQ ID 
NO:330) ; SET 140: (SEQ ID No:331, SEQ ID No:332; SBQ ID No:333) ; 
SET 141: (SEQ ID No:334, SEQ ID NO:335; SBQ ID Ko:336) ; SET 142: 
(SEQ ID No:337; SEQ ID No:338; SBQ ID No:117) ; SET 143: (SEQ ID 
Kro-339; SBQ ID No:340; SEQ ID Ho:341) ; SET 144: (SBQ ID llo:342; 
SEQ ID No:343, SEQ ID No:344) ; SET 145: (SEQ ID No:345; SBQ ID 
NO:346) f SET 146: (SEQ ID No:347, SBQ ID No:348; SEQ ID No:349) ; 
SET 147: (SEQ ID No:350; SBQ ID No:351) , SET 148: (SBQ ID No:352; 
SBQ ID No:353) ; SET 149: (SEQ ID No:354, SEQ ID No:355) ; SET 
150: (SEQ ID No:356; SEQ ID No:357) ; SET 151: (SEQ ID No:358; SEQ 
ID No:359; SEQ ID No:360) ; SET 152: (SEQ ID No:361,- SEQ ID Ro:31) 
; SET 153: (SEQ ID No:362; SEQ ID NO:363; SEQ ID No:364) ; SET 
154: (SBQ ID No:365; SBQ ID No:366; SBQ ID Ho:367) ; SET 155: (SEQ 
ID No:368, SEQ ID No:369; SEQ ID NO:300) , SET 156: (SEQ ID 
No:370; SEQ ID NO:371) ; SET 157: (SEQ ID No:372, SBQ ID No:373; 
SEQ ID No:108) ; SET 158: (SBQ ID No:374, SEQ ID No:375; SEQ ID 
No:376) f SET 159: (SEQ ID No:377, SEQ ID NO:378; SEQ ID No:379) ; 
SET 160: (SEQ ID No:380; SBQ ID No:381) ; SET 161: (SEQ ID No:382; 
SEQ ID No:383.. SEQ ID No:384) ; SET 162: (SEQ ID No:385; SEQ ID 
No:386; SBQ ID No:387) ; SET 163: (SEQ ID No:388, SEQ ID NO:389, 
SEQ ID No:390) ; SET 164: (SEQ ID No:391; SEQ ID NO:392; SEQ ID 
No:393) ; SET 165: (SBQ ID No:394; SBQ ID No:395) ; SET 166: (SEQ 
ID No:396; SBQ ID No:397; SBQ ID Mo:398) ; SET 167: (SBQ ID 
No:399; SBQ ID No:400; SEQ ID No:117) ; SET 168: (SEQ ID No:401) ; 
SET 169: (SBQ ID NO:402; SEQ ID NO:403) ; SET 170: (SEQ ID No:404; 
SEQ ID no: 405; SBQ ID No: 318) ; SET 171: (SEQ ID No: 406; SEQ ID 
No:407; SEQ ID No:408) ; SET 172: (SEQ ID No:409; SEQ ID No:410, 
SEQ ID No:411) ; SET 173 : (SEQ ID No:412; SEQ ID No:413) ; SET 
174: (SBQ ID No: 414; SEQ ID No: 415; SEQ ID No: 416) ; SET 175: (SEQ 
ID No:417; SEQ ID No:418; SEQ ID Ko:419) ; SET 176: (SEQ ID 
No:420; SEQ ID NO:421; SEQ ID No:422) ; SET 177: (SEQ ID No:423; 
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SBQ ID Ko:424; SBQ ID No:425) ; SET 178: (SBQ ID Ho:426; SEQ ID 

No:427; SEQ ID No:428) ? SET 179: (SBQ ID No:429; SBQ ID Ho:408) ; 

SET 180: (SBQ ID No:430) ; SET 181: (SBQ ID No:431) ; SET 182: 
(SBQ ID no: 432) ; SET 183: (SBQ ID Ho:433,- SBQ ID lIo:434) ; SET 

184: (SBQ ID NO:435; SEQ ID No:436) ; SET 185: (SEQ ID No:437) ; 

SET 186: (SBQ ID Nos438; SEQ ID No:439) ; SET 187: (SEQ ID No:440; 

SEQ ID No:441) ; SET 188: (SEQ ID No:442) , SET 189: (SEQ ID 

No:444) ; SET 190: (SEQ ID No:445) ; SET 191 (SBQ ID No:446 ; SEQ 
ID NO:447) ; SET 192: (SBQ ID Nos448) ; SET 193t (SEQ ID NO:449) ; 
SET 194: (SEQ ID No:450) : SET 195: (SBQ ID No:4Sl) ; SET 196: (SBQ 
ID No:452) ; SET 197: (SEQ ID lIO:4S3) ; SET 198: (SEQ ID No:454) ; 
SET 199: (SBQ ID NO:455) ; SET 200: (SBQ ID NO:456) i SET 201: 
(SBQ ID No:457) ; SET 202: (SBQ ID No:458) ; SET 203: (SEQ ID 
Ho: 459) ; SET 204: (SEQ ID No: 460) ; SET 205: (SEQ ID No: 461) ; 
SET 206: (SBQ ID NO:462) ; SET 207: (SEQ ID No:463) ; SET 208: 
(SBQ ID No: 464) ; SET 209: (SEQ ID No:465) ; SET 210: (SEQ ID 
No:466) ; SET 211: (SBQ ID No:467) ; SET 212: (SEQ ID No:468) 

3. A polynucleotide library according to Claim 
2 wherein said polynucleotide sequences or subsequences 
thereof of said pool correspond to any combination of at 
least one polynucleotide selected among those included in at 
least 50%, preferably 75% and more preferably 100% of the 
predefined sets. 

4. A library according to anyone Claim 1 or 2 
wherein the pool of polynucleotide sequences or subsequences 
correspond substantially to any combination of at least one 
polynucleotide sequence selected among those included in each 
one of predefined polynucleotide sequences sets cotiqprising: 

SET 1: (SBQ ID No:l ; SBQ ID NO:2) ; SET 4: (SBQ ID 
NO:7 ; SBQ ID NO:8) ; SET 18: (SBQ ID Mo:39 SBQ ID No:40 ; SEQ 
ID No:41) ; SET 21: (SEQ ID No:46 ; SBQ ID No:47) ; SET 24: (SEQ 
ID NO:54 , SEQ ID No:55 ; SBQ ID NO:56) ; SET 32: (SEQ ID No:76 ; 
SBQ ID No: 77 ; SBQ ID No: 78) ; SET 38: (SEQ ID No: 91 ; SEQ ID 
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26. A polynucleotide library according to Claim 
25 wherein the support is selected from the group coinprising 
a nylon membrane, nitrocellulose membrane, glass slide, glass 
beads, membranes on glass support or a silicon chip. 

27. A polynucleotide array useful for 
prognosis or diagnostic of tumor comprising an immobilized 
polynucletide library according to Claims 1 to 3. 

28. A polynucleotide array useful to 
differentiate a normal cell from a cancer cell conqprising any 
combination of immobilized polynucletide sequences sets 
according to claims 4 to 7. 

29. A polynucleotide array useful to detect a 
hormone sensitive tumor cell comprising any combination of 
immobilized polynucletide sequences sets according to claims 
8 to 11. 

30. A polynucleotide array useful to 
differentiate a tumor with lymph nodes from a tumor without 
lyi,5,h nodes comprising any combination of immobilized 
polynucletide sequences sets according to claims 12 to 15. 

31. A polynucleotide array useful to 
differentiate antracycline- sensitive tumors from 
antracycline-insensitive tumors comprising any combination of 
immobilized polynucletide sequences sets according to claims 
16 to 19. 



32. A polynucleotide array useful to classify 
good and poor prognosis primary breast tumors comprising any 
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